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Q . A method of counts-in-cells analysis of galaxy distribution is investigated with ar- 

00 1 bitrary smoothing functions in obtaining the galaxy counts. We explore the possiblity 

of optimizing the smoothing function, considering a series of m-weight Epanechnikov 
kernels. The popular top-hat and Gaussian smoothing functions are two special cases 
00 | in this series. In this paper, we mainly consider the second moments of counts-in-cells 

^ j as a first step. We analytically derive the covariance matrix among different smooth- 
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ing scales of cells, taking into account possible overlaps between cells. We find that 



the Epanechnikov kernel of m — 1 is better than top-hat and Gaussian smoothing 
■ functions in estimating cosmological parameters. As an example, we estimate ex- 

Oh 1 pected parameter bounds which comes only from the analysis of second moments of 

galaxy distributions in a survey which is similar to the Sloan Digital Sky Survey. 
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1. Introduction 

The large-scale structure of the universe is one of the most powerful probes of the 
universe. The structure of the galaxy distribution is a consequence of how our universe began, 
how it has been evolved with time, and what the universe is made of. Galaxy distributions are 
inherently statistical, and methods of quantifying them are not uniquely given. The counts- 
in-cells (CIC) analysis, which we consider in this paper, is one of the most simple methods 
among them. After quantifying the observed galaxy distributions by some statistical quantity, 
the cosmological parameters are estimated by that quantity. 

According to the observations of the temperature anisotropy of the cosmic microwave 
background radiation (CMB), such as the Wilkinson microwave anisotropy probe (WMAP) 
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(Spergel et al. 2003; Spergel et al. 2006) and magnitude-redshift relation of type la SNe (SNIa) 
(Riess et al. 1998; Perlmutter et al. 1999), the cosmological constant is necessary to account for 
the behavior of the universe. However, in order to understand the evolution of structures, it is 
also important to explore structures in different epochs. Galaxy distributions probe more recent 
universe than observations such as CMB and SNIa. Recent galaxy redshift surveys, such as the 
AAT Two-Degree Field Galaxy Redshift Survey (2dFGRS) and the Sloan Digital Sky Survey 
(SDSS) reveal the large-scale structure of the mostly present universe (Colless 2003; York et al. 
2000). By combining the data of the galaxy distributions and other observations like WMAP, 
many cosmological parameters are accurately estimated (Percival et al. 2001; Tegmark et al. 
2004; Seljak et al. 2005). 

One of recent trends of the parameter estimation is to use a number of independent 
observations in order to improve constraints of the parameters as accurate as possible. Other 
one is to constrain the cosmological parameters from independent observations in order to make 
cross-checks of a cosmological model. The latter one is important because the assumed cosmo- 
logical model is not guaranteed to be correct. For example, the dark energy is introduced just as 
a parameter, not knowing the deep nature. On the other hand, there are many efforts to explain 
an acceleration of the universe without introducing dark energy. People are trying to explain the 
acceleration of the universe by, for example, a back reaction due to super-horizon density fluc- 
tuations, modifications of gravity, and so on (Mukhanov, Abramo, Brandenberger 1997; Carroll 
et al. 2005; Flanagan 2005; Navarro, Acoleyen 2005; Nojiri, Odintsov 2005; Rasanen 2006). 

In estimating cosmological parameters from independent observations, some of the pa- 
rameters cannot be determined with good accuracy because of possible weak signals or de- 
generacies between parameters. For example, when we consider the cosmological parameter 
estimation from the CMB, the free streaming dumping due to neutrinos is less significant than 
that in the present large-scale structure. When we consider the cosmological parameter estima- 
tion from the large-scale structure, the silk dumping due to baryons and the free streaming due 
to neutrinos show similar dumping in the power spectrum on small scales. Each observation 
excels others at estimating some parameters but is not good at estimating other parameters. 

In this paper, we consider parameter estimations using only a CIC analysis of the large- 
scale structure. In the CIC analysis, one needs a smoothing function, which is a function of 
distance from a center of each cell. Usually, the top-hat function and the Gaussian function 
are used. The top-hat function is often used for counting galaxies in each cells. The Gaussian 
smoothing function is used when we need to smooth out small-scale clustering and to obtain 
large-scale properties of clustering. For example, Gaztanaga, Yokoyama (1993) used a Gaussian- 
like smoothing function for that purpose. Although top-hat and Gaussian functions are simple, 
they are not necessarily optimal for estimating cosmological parameters. In this study, we 
investigate effects of the choice of smoothing functions. We introduce a series of smoothing 
functions: the m-weight Epanechnikov kernels. The top-hat function and the Gaussian function 
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are two special cases in this series. Since this function has intermediate properties between these 
two functions, we expect that it is useful to search for better function in analyzing the large-scale 
structure. Since the m-weight Epanechnikov kernel has a finite support, its Fourier counterpart 
of the kernel, W m (k), oscillates as a function of k. This property leads us to expect a possibility 
that we can find a specific smoothing function which is sensitive to oscillating features in the 
power spectrum, such as the baryon acoustic oscillations (BAO). 

This paper is organized as follows. In § 2, we briefly summarize the CIC analysis and the 
m-weight Epanechnikov kernels. In § 3, our methods using the covariance matrix in analytic 
form with the Fisher information matrix are explained. In § 4, we present numerical results of 
our study. Conclusions and discussion are given in § 5. 

2. A Counts-in-Cells Analysis and Smoothing Functions 

2.1. A Counts-in-Cells Analysis 

We consider a CIC analysis of the galaxy distribution on large scales. The CIC analysis 
is one of the methods to analyze the large-scale structure. First of all, a number of cells are 
randomly distributed in a survey volume and the number of galaxies are counted in each cell. 
Next, the moments, or semi- invariants, are calculated from the numbers of galaxies in those 
cells. The CIC analysis has an advantage that it is simpler to analyze than more complex 
statistics such as the two-point correlation function and the power spectrum, etc. Behaviors of 
the statistical errors are also studied well (Szapudi, Colombi 1996). 

The CIC analysis was developed by Peebles (1980) and applied to the IRAS and Stromo- 
APM redshift survey (Efstathiou et al. 1990; Loveday et al. 1992; Efstathiou 1995). In these 
papers, they measured a count probability distribution function, Pn(R), which is a probability 
that a cell contains N galaxies in a spherical cell. They obtained the second moment of the 
distribution function from likelihood fitting of the distribution function to an analytic log- 
normal function. Recently, a skewed log-normal function is used as an improved analytic 
function (Colombi 1994; Ueda, Yokoyama 1996; Szapudi, Pan 2004). In this study, we simply 
consider the second moments of the galaxy counts, which directly obtained by averaging the 
square of galaxy counts (Peebles 1980). 



The CIC analysis is related to the smoothed density field with a smoothing function 
W(r;R): 



where 8{r) is a 3-dimensional density contrast and R is a smoothing scale. In this paper, we 
use a normalization 



This normalization is adopted in such a way that top-hat smoothing function has a normal- 
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ization, WTn{r;R) = 1 (r < R), (r > R). When the function is not a top-hat one, this 
normalization is just a convention. The second moment of the density contrast is given by 
(Cramer 1946) 

a 2 ( R) = (S B 2 ) = ( JL) 2 J d 3 nd 3 r 2 C(\r 1 -r 2 \)W(\r 1 \;R)W{\r 2 \;R) 

/ 3 \ 2 r d 3 k 
" \4nR 3 ) J (2tt) ; 

where £(r) is the two-point correlation function, P(k) is the power spectrum, and 

W(kR) = J d 3 re- lk - r W{\r\;R) (4) 

is a Fourier counterpart of the smoothing function, which is a real function for a spherical 
smoothing function. Similarly, the higher-order moments, (^R 3 )) (^R 4 )' ■ ■ ■ are gi ven by higher- 
order correlation functions and polyspectra. 

The density contrast 5(r) is a smooth function of space, while the count of galaxy is 
a discrete number. We consider the case that galaxies are counted in each cell with a weight 
function W([t*|; -R), which is a same function we introduced above as a smoothing function. 
Each galaxies are counted with a weight according to the distance from centers of cells. The 
count of galaxies iV in a cell is defined by 

N= E W(\r g -r c \;R), (5) 

g:galaxies 

where r g is a position of each galaxies, and r c is a center of the cell. For a top-hat smoothing 
function, the above quantity exactly corresponds to a number of galaxies in a sphere of radius 
R, and N is a non-negative integer. In general smoothing functions, N is not necessarily an 
integer, despite its notation. Hereafter we call iV a weighted count. 

In a case of the top-hat smoothing function, relations between moments of the smoothed 
density contrast and moments of the galaxy counts are given in Peebles (1980). We need to 
generalize such relations to the cases of general smoothing function. We derive those relations 
in Appendix 1. The second moment of the density contrast is related to the galaxy counts by 
equation (A5), 

where (N) is the weighted count averaged over all cells, (N 2 ) is the second moment of the 
weighted count, and 

W = ^Jd\{W(\r\-,R)} 2 . (7) 

The right hand side of equation (6) are given only by observable quantities. The second term is 
a shot noise term. Relations between higher-order moments of the density contrast and that of 
weighted counts are given in the Appendix 1, although we focus only on the second moments 
below. 
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2.2. The m-weight Epanechnikov kernel 

The choice of the smoothing function W(r;R) remains arbitrary in the analysis explained 
above. The top-hat function 

W TH ( r] R) = Q(R-r), (8) 

where 

[ 1, (x>0), 

is a Heaviside step function, is usually adopted in the CIC analysis. Another choice is a 
Gaussian function, 

W Q (r;R) = -^=e X p(~). (10) 



3V2^ V 2R \ 

In this study, we introduce a series of smoothing functions which has an intermediate properties 
of the two: the m-weight Epanechnikov kernel. This series of functions are defined by 

In this smoothing function with positive weight m, the weight is large near the center of cells 
and small near the edge of cells. In the case m = 0, this kernel is the top-hat function of 
equation (8). The original Epanechnikov kernel (Epanechnikov 1969) corresponds to m = 1, 
and the m-weight kernel of 0,1 is a generalized kernel of the original one. The Fourier 
counterparts of the Epanechnikov kernels are given by 

w m(kK ^(2^±m^m (12) 

Since the m-weight Epanechnikov kernel has a finite support, its Fourier counterpart W m (k) 
oscillates as a function of k. One can expect a possibility that there is a specific smoothing 
function which is sensitive to a certain physical feature in the power spectrum, such as the 
baryon acoustic oscillations (BAO). 

In the m-weight Epanechnikov kernels with a large weight m, the weight near the edge 
of the cells, r ~ R is very small. Therefore the characteristic radius of the kernel is not given 
by the radius of the support R. Instead, it is useful to define an effective radius R c s of the 
m-weight Epanechnikov kernel as a characteristic radius: 

1/2 f o ,1/2 



R 



eff 



47ri? 3 



J d 3 rW m {\r\;R) r' 



2m + 5 



R. (13) 



When we take the limit of m — > oo fixing the effective radius R e g, the m-weight Epanechnikov 
kernel is shown to be reduced to the Gaussian function Wc(r] R e s) (Matsubara, Szalay 2002). 
Therefore, the series of m-weight Epanechnikov kernels contain both the top-hat function (m = 
0) and the Gaussian function (m = oo) as special cases. Although taking a simple limit of 
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m — > oo results in divergent expressions in this paper because of our normalization convention, 
the derived observable quantities are still regular in this limit. 



3. The Fisher Matrix Analysis and the Covariance Matrix 

3.1. The Fisher matrix analysis 

The main purpose of this study is to find an appropriate smoothing function. Since the 
degrees of freedom in choosing the smoothing function are infinitely large, we constrain ourselves 
in searching an optimal weight m of the m-weight Epanechnikov kernels. To this end, we use 
the method of Fisher information analysis. This analysis is a method to theoretically estimate 
expected parameter bounds that can be achieved with an unbiased estimator (Kendall, Stuart 
1996). This analysis is often used in order to estimate how well one can constrain cosmological 
parameters in a given observation (Matsubara 2004; Perotto et al. 2006; Takada, Komatsu, 
Futamase 2006, etc.). 

The Fisher information matrix F is defined by 



where C(0\x) is a likelihood function of model parameters when a set of data x is given. 
The average is taken over assumed distribution of data, which is specified by a fiducial model 
with a set of parameters 0. When the likelihood function has the maximum at a set of fiducial 
model parameters, the first derivatives of the likelihood should vanishes. The second derivatives 
specify how the likelihood function is peaked at the maximum point. Thus, the Fisher matrix is 
a useful quantity to estimate how one can constrain model parameters in a likelihood analysis. 

In fact, when only one parameter 6^ is estimated, fixing other parameters, there is a 
Cramer- Rao inequality (Kendall, Stuart 1996; Tegmark 1997; Matsubara, Szalay 2002), 



where A8{ = 9i — (0i) is a deviation of that parameter, and the right hand side of equation (15) 
is a square of a standard deviation. In the limit of a very large data set, the inequality becomes 
an equality. Therefore, the diagonal elements of a Fisher matrix offer criteria of how powerful a 
given analysis is. When multiple parameters are simultaneously estimated, there is a generalized 
correspondence, 



in the limit of a very large data set. Therefore, expected parameter bounds by a given data set 
can be estimated by calculating a Fisher matrix. 

According to the Bayes' theorem, the likelihood function in equation (14) is given by 




(14) 




(15) 
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V(x\0)V(0) 
V(x) 
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where V(x\6) is the conditional probability distribution function of the data x for a given set 
of parameters 9, V(6) is the prior distribution function of parameters. The prior distribution 
function of data V(x) plays a role of just a normalization in the Bayes' theorem. 

We assume in the below that there is not any prior knowledge of the parameters. In 
that case, the prior distribution function V(0) is constant, and C(6\x) oc V(x\0) as functions 
of parameters 0. In equation (14), C(6\x) can be replaced by V(x\8). When the distribution 
of data x is given by a multivariate Gaussian function, 



V(x\0) = , 1 =exp 
/ {2ir) N detC 



l -{x-{x)fC-\x-{x)) 



(18) 



where N is the dimension of the data vector x, i.e., the number of data. The N x N matrix C 
is a covariance matrix given by 

C(e) = ((x-(x})(x-(x}f), (19) 

where the average is taken over data x, which distribution is given by a set of model parameters 
6. Thus, the covariance matrix explicitly depends on model parameters 6. The Fisher matrix 
for a multivariate Gaussian distribution of equation (18) is given by (Tegmark, Taylor, Heavens 
1997) 

1 



86; 09, 



+ ^C-*. (20) 
89 i 89, y } 
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In reality, the distribution of data is generally not a multivariate Gaussian. Even in 
that case, Fisher matrix of equation (20) is still useful to estimate the expected parameter 
bounds. In fact, Perotto et al. (2006) compares the Fisher information matrix analysis and 
the Markov Chain Monte Carlo (MCMC) method by using a simulation. According to their 
results, the Fisher matrix analysis and MCMC method agrees well, when the likelihood function 
is not much different from a (multivariate) Gaussian function. When the likelihood function 
is sufficiently peaked at the maximum, the Fisher matrix analysis gives a good prediction for 
expected parameter bounds. 

3.2. The covariance matrix of second moments 

The Fisher matrix is calculated by equation (20), once a covariance matrix C is obtained 
as a function of model parameters. We use a second moment of the CIC as a set of data x. For 
a fixed smoothing function, the second moments c 2 (R) is a function of a smoothing radius, R. 
We assume that we have a set of data which consists of second moments with various R, and 
the model parameters are estimated by a maximum likelihood analysis. Therefore, the data 
vector is given by 

x= [a 2 (R 1 ),a 2 (R 2 ),...,a 2 (R N )]\ (21) 

where R\ < R 2 < •■■ < Rn are a given set of smoothing radii. The values of the smoothing 
radii are chosen so that the density fluctuations are modeled by linear theory and we treat the 
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problem as analytic as possible. Thus the minimum value of the smoothing radius R\ cannot be 
arbitrarily small. On the other hand, the smoothing radius cannot be arbitrarily large because 
the radius cannot be larger than a survey size. 

The number of data, which is the same as the number of smoothing radii, can be ar- 
bitrary chosen. The larger the number of data is, the tighter the parameters are constrained. 
However, second moments with slightly different smoothing radii do not have quite independent 
information. This property is taken into account in our analysis as we will see below. 

The matrix elements of the covariance matrix we need is therefore given by 



The calculation of this covariance matrix is not trivial. We develop a method to analytically 
calculate this matrix under certain approximations, which is technically a main elaboration 
of this work. The details are described in Appendix 2. The statistical distribution of the 
second moments depends on the individual properties of galaxy samples, such as the density 
of galaxies, the size of the survey volume, etc. These properties are taken into account in the 
calculation of the covariance matrix. In the calculation, we allow the volume of a cell can be 
overlapped to that of another cell. Therefore, the number of cells placed in the survey volume 
is not limited. We adopt an approximation that a survey volume has a spherical shape with a 
radius L. The boundary effects of the survey volume are neglected. These approximations are 
useful for an analytical tractability of the calculation (Appendix 2). 

4. Numerical Results 

4-1. Assumed galaxy surveys and cosmological parameters 

The first sample we consider is the one which is similar to the main galaxy sample of 
SDSS. We refer this sample as MG. In this sample, the number density of galaxies is set to be 
n = 3.3 x 10~ 3 (/i~ 1 Mpc)~ 3 . The survey size is set by a radius L = 378/i _1 Mpc. The second 
sample is the one which is similar to the luminous red galaxy sample of SDSS. We refer this 
sample as LRG. This sample is characterized by n = 10~ 4 (/i~ 1 Mpc) -3 and L = 723 /i~ 1 Mpc. 
The redshifts of galaxies in both samples are not large and we simply neglect redshift evolution 
of galaxy clustering. We also neglect the bias for simplicity, although the real samples in the 
SDSS has a certain bias. The QSO sample in the SDSS has very sparse density (Matsubara, 
Szalay 2002) and we do not consider a sample of this kind. We do not mean to make MG and 
LRG to resemble the actual SDSS survey. Instead, we intend to contrast two different types of 
sample. Since we neglect the redshift evolution of clustering and the bias, the power spectrum 
of both samples are identical. The main difference is the survey volume and the number density, 
which are decisive factors in cosmological parameter estimations. 

In the Fisher analysis, we need to assume maximum likelihood estimates of param- 
eters, which is called fiducial model parameters. We adopt the fiducial parameters as 




(22) 
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(fi m , fB,Qv,h,n s ,<T$ 2 ) = (0.3, 0.15,0. ,0.7, 1.0, 1.0), where fl m is the matter density parameter, fs 
is a fraction of the density against the matter density, f2„ is the neutrino density parameter, 
h is the hubble constant normalized by lOOkm/s/Mpc, n s is the spectral index of the primor- 
dial power spectrum, and ag 2 is the standard normalization of the density fluctuations. The 
bias factor of both samples, MG and LRG, are assumed to be unity. In order to obtain the 
power spectrum of the present universe, we calculate the transfer function by the CMBfast code 
(Seljak, Zaldarriaga 1996). Normalization of the power spectrum is determined by setting u 8 2 . 
To ensure that the typical scale of fluctuations should not be in strongly nonlinear regime, we 
only use effective smoothing radii of equation (13) which satisfies R e g > 10/i _1 Mpc. The largest 
smoothing radii should be fairly smaller than the survey size, since we neglect boundary effects 
of the survey volume. In our calculation below, the largest radii are chosen as R c g = 75/i Mpc 
for the MG and R c g = 110/i -1 Mpc for the LRG. Linearly equal spacings of smoothing radii Ri 
in equation (21) are adopted. The number of smoothing radii N are varied in the following 
analysis. The number of cells placed in the survey volume N cc \\ s are given by (L/R e s) 3 . 

4-2. Effects of smoothing function 

In figure 1, we show the second moments a 2 and their statistical errors as functions of 
R c k for both the MG and the LRG. These are analytically given by equations (3) and (A45). 
In this figure, the solid line is the second moment of the fiducial model, which is common to 
both samples, dashed line is an expected error for the MG and dot-dashed line is an error for 
the LRG. Relatively flat errors on larger scales are dominated by cosmic variance, which are 
less for the LRG than the MG because of the larger sampling volume. A little decrease of the 
error in MG around 100/i _1 Mpc is due to the boundary effect which is not properly taken into 
account in our analytic calculation. On scales where the boundary effect is significant, errors 
due to the cosmic variance is already larger than the signal, and we do not have to worry about 
the boundary effect. Increasing errors toward smaller scales are due to the shot noise, which is 
less for the MG because the MG is denser than the LRG. As mentioned above, we only use the 
large scales, i? e ff > 10/i _1 Mpc, where the errors are dominated by cosmic variance, and thus 
the LRG has larger signal to noise ratio than that of the MG. 

In figure 1, there is a little decrease in the cosmic variance at ~ 100/i~ 1 Mpc in the case 
of the MG sample . This is caused by our approximation to neglect the boundary effects as 
described in Appendix 2. Therefore, when R becomes comparable to L, the cosmic variance 
is underestimated. However, the signal is already dominated by the cosmic variance in this 
regime, and this effect does not affect the results. 

The number of cells placed in the survey volume N ce \\ are given by (L/R c s) 3 , as men- 
tioned in the previous section. The number affects the results through a factor l/jV ce ii appearing 
in equation (A45). The first line of equation (A45) represents the shot noise, the second line 
the shot noise from overlapping regions, and the third line the cosmic variance. In increasing 
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the numbers of cells for both the MG and the LRG, the shot noises decrease as mentioned 
by Gaztanaga, Yokoyama (1993). However, the shot noise from overlapping regions becomes 
dominant on small scales R e s < 10 h~ l Mpc. In addition, the statistical erros are dominated by 
the cosmic variances on R c g > 10 h~ x Mpc, and the factor appearing in the third line of equa- 
tion (A45) is < 0.01 at most. Therefore, we can find from equation (A45) that the numbers of 
cells are sufficient and that increasing the number of cells does not affect the following results: 
the statistical error does not diminish more than 1% if the number of cells is increased. 

On the other hand, in decreasing the number of cells, scales dominated by the shot noise 
become larger, and the signal to noise ratio diminishes. For the MG, decreasing the number 
of cells to one tenth of the present number does not affect the signal to noise ratio so much. 
However, decreasing the number to one hundredth significantly lowers the ratio. Therefore, the 
number of cells is enough for the cosmic variance to dominate the shot noise on R c q = 10 h~ l 
Mpc. The shot noise contributes the error only < 1% on that scale. For the LRG, decreasing 
to one tenth significantly reduces the signal to noise ratio, since the shot noise is comparable 
to the cosmic variance on scales of R e s ~ 10 h~ x Mpc. A hundred times larger number of 
cells than the present number is necessary to suppress the shot noise contribution to the same 
level as the MG on R e g = 10 h^ 1 Mpc. Therefore, increasing the number of cells enlarges the 
signal to noise ratio on scales of R c s ~ 10 h~ x Mpc and would slightly tighten constraints of 
the parameters. 

Figure 2 shows how the matter density parameter and the baryon fraction are constrained 
by plotting the diagonal elements of the corresponding Fisher matrix as functions of the numbers 
of data, i.e., the number of smoothing radius N. This number can be arbitrarily large because 
this number is artificially chosen. For larger numbers of data, the Fisher diagonal elements 
are larger and expected error bounds become smaller. However, the Fisher diagonal elements 
cease to increase for sufficiently large numbers of data, because the second moments of similar 
smoothing radii do not carry independent information. Therefore, the saturation values of the 
Fisher diagonal elements for the large numbers of data indicate the expected error bound from 
the analysis of second moments. 

According to this figure, the MG with about 400 data points is enough to obtain nearly 
minimum errors for both Q m and /b- That means increasing the numbers of data above 400 
is not efficient. In the LRG, about 600 data points are enough. The LRG can constrain the 
cosmological parameters more tightly than the MG sample, because of the large volume. For 
both the MG and the LRG, expected errors for other parameters have similar tendencies. 

The saturation values are different for different m, which shows effects of smoothing 
functions for the parameter estimation. The Fisher diagonal elements in both parameters are 
largest when we use m — 1 (original) Epanechnikov kernel. Using the top-hat and Gaussian 
functions are nearly worst among m-weight Epanechnikov kernels. Improvements of the ex- 
pected errors are about 14% for the LRG and 11% for the MG (in each case). Although the 
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improvement is not so significant, using the Epanechnikov kernel is definitely better than us- 
ing top-hat and Gaussian kernels as a smoothing function. The reason why the Epanechnikov 
kernel is better than top-hat and Gaussian kernel will be discussed in the next section. 

There is another suggestion in this Figure. For example, expected errors for N = 400 
by the top-hat (m = 0) function are almost the same as N = 200 by the m — 1 Epanechnikov 
function in every case. In this case, optimizing the smoothing function has the same effect as 
increasing the data points with respect to the parameter estimation. In practice, increasing data 
points increases computational costs of evaluating the likelihood function. Using the optimized 
smoothing function has an advantage in this respect. 

In Figure 2, the expected errors for weights m = and m = 1 are quite different in both 
the MG and the LRG. It is natural to consider non-integer weights to looking for more optimal 
function. Figure 3 shows the dependence of the Fisher diagonal elements for fl m against the 
weight m. It follows from this figure that m = 0.6 kernel is slightly better than the m = 1 kernel 
for both the MG and the LRG. 

To test the effect of BAO in the power spectrum, we compare the Fisher matrix elements 
with and without the baryonic component in the power spectrum. Figure 4 shows a result. 
Panel (a) shows a dependence of the value of Fisher matrix element i*h m n m with the baryonic 
component. The fiducial model of the panel (a) is the same as that explained in § 4.1. Panel (b) 
shows a same dependence as the panel (a) but in the pure CDM model. The fiducial parameters 
in this panel are assumed to be (Q m ,fB,Qv,h,n s ,a 8 2 ) = (0. 3, 0.,0., 0.7, 1.0,1.0). In the panel (b), 
we use the fitting formula of Bardeen et al. (1986) to calculate the transfer function in the 
pure CDM model. If the BAO plays a main role in the improvement of parameter constraints, 
it is expected that the pure CDM model does not show any dependences on weight of the 
kernels, which is not found in this study. Therefore, we conclude that the improvements come 
from other reasons, such as an overall envelope of the smoothing function or covariances. In 
addition, if the oscillation feature is the main reason, we expect the constraint for baryon 
fraction parameter is especially tightened, which is also not found in this study. Therefore, the 
fact that the Epanechnikov kernel of m = 1 gives the tightest constraints for every parameter 
also supports this conclusion. 

4-3. Expected parameter bounds 

Figure 5 and Figure 6 demonstrate expected parameter bounds for the LRG and for the 
MG, respectively. In both Figures, solid lines represent la confidence ellipsoids when we carry 
out two-parameter likelihood analysis fixing all the other parameters. Dashed lines show the 
la confidence ellipsoids when we carry out likelihood analysis for the all parameters at a time 
and marginalized over the all parameters but two. In all cases, the LRG can constrain the 
parameters more tightly than the MG, because the survey volume is larger in LRG. 

The expected parameter bounds in the case all the other parameters are marginalized 
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over are much broader than in the case of just fitting only two parameters. Parameter bounds 
for /b, h, and n s are especially broadened. This is mainly because there are degeneracies among 
these parameters, which one can notice in the Figures 5 and 6. Therefore, simultaneous deter- 
mination of all the parameters considered here is not efficient when only the second moments 
are used. It is rather preferable to consider that such degenerate parameters are sufficiently 
constrained by other observations. Figure 7 shows the parameter bounds in the case that h 
and n s are fixed. Dashed lines are la confidence ellipsoids for the MG. The solid lines are 
for the LRG. One sees that the expected bounds are much tighter than previous figures, and 
degeneracies between parameters are resolved. 

5. Conclusion and Discussion 

We consider the CIC analysis of the large-scale galaxy distribution. We estimate the 
expected parameter bounds from the second moments by means of the Fisher matrix analysis, 
considering SDSS-like surveys as working examples. In this analysis, we investigate a series 
of smoothing functions, expecting that there is an optimal smoothing function with respect to 
the parameter bounds. We derive an analytic form of the covariance matrix. As a result, we 
found that it is possible to improve the parameter bounds by using an optimized kernel; the 
Epanechnikov kernel gives a better parameter bounds than traditional top-hat and Gaussian 
kernels. As previous study shows, the LRG sample is more suitable than the MG sample for 
parameter estimation because of the larger observational volume. 

We carried out two other tests to investigate the reason why the Epanechnikov kernel 
gives the tightest constraints among all generalized Epanechnikov kernels. First, we test the 
hypothesis that the oscillating phases in the power spectrum and that in smoothing kernels 
are synchronized with each other. It seems this hypothesis is not realistic because of the 
comparison in Figure 4. Actually, we do not find clear evidence of the phase synchronization, 
directly comparing the both oscillations. 

Second, we test the hypothesis that the signal to noise ratio significantly depends on 
the shape of the kernels. As a result, we do not find signficant difference between the top-hat 
smoothing function and the Epanechnikov kernels. Therefore, we have not identify a clear 
reason why the Epanechnikov kernel is the best, which remains to be studied in the future 
work. 

From our results, the following points are notable for parameter estimation. About 400 
and 600 different smoothing scales are sufficient to minimize the parameter bounds for the 
MG sample and LRG sample, respectively. Due to the covariance between data with different 
smoothing scales, too many data with different smoothing radii have a redundant information. 
In addition, the parameters Qb, h, and n s degenerate in the parameter estimation. This 
degeneracy is due to the fact that varying these parameters similarly change the shape of the 
power spectrum on scales of interest. 
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In this analysis, we do not explicitly consider redshift-space distortion effects. In treating 
an actual galaxy distributions, radial distances are measured by redshifts, and the redshift- 
space clustering of galaxies is the observable (Kaiser 1987). In equations (A45) and (A46), 
integrations over r a and rp do not have anything to do with a line-of-sight direction. Thus, 
integrations of equations (All) and (A25) results in equations (A49) and (A51). In addition, 
we can take z-axis as a line-of-sight direction in integrating over k. Therefore, we find that this 
modifies the covariance matrix in such a way that an additional factor is added to the cosmic 
variance. As a result, the last terms of equations (A45) and (A46) are multiplied by a factor 
(1 + 2/5/3 + /3 2 /5), where (3 is a redshift-space distortion factor and is approximately given by 
(3 = Q^/b, and b is the linear bias factor. Therefore, the redshift-space effects are absorbed in 
the normalization of the power spectrum. 

In Appendix 1, we derive relations between higher-order moments, and galaxy counts 
for an arbitrary smoothing function. Although we analyze only second moment in this pa- 
per, higher-order moments provide additional information on cosmology. One can expect the 
possibility to search for an optimal smoothing function for the higher-order moments in CIC 
analysis. By combining the second moment and higher-order moments, one can break the 
degeneracy between the bias b and the normalization o"g 2 and reduce other degeneracies in pa- 
rameters (Sefusatti et al. 2006). Theoretical predictions for the higher-order moments can be 
found in Bernardeau (1994a); Catelan, Moscardini (1994). In a case involving bias parameter, 
relations between reduced moments of galaxies and that of matter are given in Bernardeau 
(1994b). However, analytical treatment of the covariance matrix for higher-order moments 
will be more involving. In that case, some numerical evaluation of the covariances would be a 
realistic method. 

In the case of evaluating the power spectrum P(k) of the 2dFGRS, 10 simulated real- 
izations are used to take into account non-linear effects, redshift-space effects, and geometric 
effects (Percival et al. 2001). To obtain an accurate covariance matrix of P(k), they need about 
1000 realizations. It is not realistic to run full N-body simulations, and they use realizations 
of random Gaussian fields for that purpose. Otherwise, one can use the PTHALOS code to 
generate the mock galaxy distribution that are based on the conjunction of a halo model and 
a second-order perturbation theory (Scoccimarro, Sheth 2002). In addition, when we carry 
out the likelihood analysis with many parameters, the grid-based likelihood analysis is also 
unrealistic because of the cost of CPU time. Recently many teams use the MCMC method 
(Christensen et al. 2001; Verde et al. 2003) to obtain the maximum likelihood of parameters in 
a reasonable time scale. 

We wish to thank Seiko Inoue for her help in earlier stages of this work. We acknowledge 
the CMBfast code which is publicly available. T.M. acknowledges support from the JSPS 
Grant-in-Aid for Scientific Research, 18540260, 2006. 



13 



Appendix 1. Relations Between Reduced Moments and Galaxy Counts with an 
Arbitrary Smoothing Function 

A. 1.1. The second moment 

In this section, we obtain a expression of a second moment of density contrast in terms of 
statistics of numbers of galaxies in a cell, following a similar procedure given by Peebles (1980). 
First, we divide the cells into a series of infinitesimal volumes, 8v a , as shown in Figure 8. A 
number of galaxies, n a , in an infinitesimal volume 5v a is either or 1. Therefore, 

{n a ) = (n 2 ) = (n 3 ) = --- = n5v a , (Al) 

where n is a mean number density of galaxies in the universe. The count of galaxies N defined 
by equation (5) is given by 

N = J2n a W(\r a \;R), (A2) 

a 

where the origin of coordinates is taken at the center of the cell r a . The mean value of the 
count is given by 

(N) = Y,{na)W(\r a \',R)=nf d 3 r a W (\r a \;R) =nv, (A3) 

where v = AnR 3 /3. Similarly, a mean square of N is calculated as 

N 2 



= (Y, n aW(\r a \,R)-J2n b W(\r b \;R)) 

\ a b I 

= E( n a 2 ){ w (l r al;^)} 2 +EK n 6) w (KI; i2 ) w (l 7 "6l; jR ) 

= nj d 3 r a {W(\r a \;R)} 2 + n 2 J d 3 r a d 3 r b {1 + f (\r a - r b \)} W(\r a \; R)W(\r b \; R) 

= (N)W+ (iV) 2 (l + ^/ d 3 r a d 3 r b £(\r a - r b \)W(\r a \;R)W(\r b \;R)^j . (A4) 

Therefore, we obtain a relation between the number of galaxies and the second moment by 
using equation (3) 



where W is given by equation (7). Below, we need a generalized quantity W n which is defined 
by 

In the case of m-weight Epanechnikov kernel, this quantity is analytically calculated and is 
given by 
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n+1 



— — 3^F r(m(n + l) + l) / 4 r(m + 5/2) V 
{ m) 4 r(m(n + l) + 5/2) T(m+1) j ' 1 J 

where is the gamma function. For the top-hat smoothing function (m = 0), the quantity 
W n is simply unity for any n. 

A. 1.2. The Skewness 

A similar relation for the third moments, or its reduced quantity skewness, can be found. 
In calculating (iV 3 ), there exist a term with a weight of {^(r)} 2 . We need to define another 
kind of galaxy count by 

N (r+V=J2na{W(\r a \;R)} p+1 , (A8) 

a 

:which corresponds to a weighted count with a smoothing function {jy(r)} p+1 . The mean value 
of this count is given by 

(N^) = "£(n a ) {W(\r a \;R)} p+1 = n f d 3 r a {W(\r a \; R)} p+1 = (N)W*. (A9) 
In the following, we also need a quantity, 



= / N (p+1) . N (q+1)\ 

= nJd 3 r a {W(\r a \;R)} 



p+q+2 

9+1 



+n 2 J d 3 r a d 3 r b (l + U){W(\r a \;R)} p+1 {W(\r b \;R)Y 

= (N)Wp+" +1 + (NfWWi (l + , (A10) 
where £ a & = £(\r a — r b \) is the two-point correlation function and we define 

tr } = j , „ — —i— — ; ■ (ah) 



J d 3 r 


a d 3 r b £(\r a -r b \){W(\r, 


^R)Y +1 {W{\r b 


;R)Y +1 


J 


U 3 r a {W{\r a 




'd 3 r b {W{\r b \ ] R)Y +1 



Note that = (^-R 2 ) = & 2 {R)- We are ready to calculate (N 3 ) for an arbitrary smoothing 
function. It is given by 

(N 3 ) = /Y / n a W(\r a \ ] R)-Y j n b W(\r b \ ] R)-Y,n c W(\r c \-,R)\ 

\ a b c I 

= (N)W +3(N) 2 w(l+ g' 0) ) + (AO 3 ( 1 + 3tT + Cr) , (A12) 



where ( R is an ensemble average of the third power of a smoothed field and corresponds to a 
volume average of the three-point correlation function, ((r a ,r b ,r c ): 

v 6 Jv 

= (5r 3 )- (A13) 
The skewness parameter is defined by 
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By using equations (A10) - (A13), we obtain 
S 3 



(N) ( (iV 3 ) - 3 (N) (N 2 ) - 3 (N 2 ) (1 ' 0) + 3 (N) 2 W + 2 (N) W + 2(iV) 3 



(N 2 ) - (N)W-(N)' 



(A15) 



A. 1.3. The Kurtosis 



The fourth moment, or its reduced quantity kurtosis, is similarly calculated as above. 
We define 



N 



3 \ (p,Q,r) 



= (n {p+1) -N {q+1) ■ iV (r+1) ) 



(N)WP+i +r+2 + (N) 2 WP+Q +1 W r ( 1 + £ (p+<z+1 ' r) 



-(p+r+l,g) 



-(p,g+r+l) 



{ny wpw q w r i+ er ; + ' + er ; + c 



(A16) 



where Cr' 9 '^ is defined by 

7(p.g>r) _ 



X 



rf 3 r a {iy(|r a |;i?)f +1 | d 3 r 6 {W(|n|; R)}« +1 J d 3 r c {W(\r c \;R)} r+1 



-i 



(A17) 



The fourth moment of galaxy count (A^ 4 ) is obtained as 



AT 4 ) = (N)W 3 + 4(N) 2 W 2 [1 + Cr 



(2,0) 



3(N} 2 (W) ' ( 1 + ti 1] ) + 6(iV) 3 W ( 1 + 2# 0) + a R 2 + & 1 



+ (A) 4 1 + 60^ + 4C R + 3a fl 4 + fJ R 



r(l,0,0) 



(A18) 



where 77 ^ is a volume average of the four-point correlation function, T)(r a ,rf),r c ,ra), and is given 
by 

Vr = ^ /d 3 r a d 3 r b d 3 ^^ 

(A19) 



S R 4 

The reduced kurtosis is defined by 

2 



Sr 4 )-S(5r 2 



6r 2 



(A20) 
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Therefore, by using equations (A10) and (A16) - (A19), we obtain 



S, = (NY 



4 (N)(N 3 ) + 18 (N) 2 (N 



6{N' 



9(N) 4 -6(N 



W 2 {N) -6 (W) (N) — QW 3 (N) 



N ) - (N) W - (NY 



-3 



Appendix 2. Derivation of the Covariance Matrix 



(1,0,0) 



-12 (AT) (N 2 ) m + 12W(N) (N 2 ) - 18W (N) 3 + 8 (N 2 ) m + 3 (iV 2X (1 



(A21) 



In this Appendix, we derive an analytic expression of the covariance matrix. Some 
complication arises to take overlappings of cells into account. We assume a locally Poisson 
process in the overlapping regions for simplicity, following Colombi, Bouchet, Schaeffer (1995); 
Szapudi, Colombi (1996). The definition of the covariance matrix is given by equation (22). In 
the following, we denote the galaxy count with a smoothing radius Ri by ATj. Individual cells 
are labeled by a,/3 and N^ a is a galaxy count in a cell a which has a smoothing radius Ri. A 
label a is used for a group cells having radii Ri and a label (3 is for group of cells having radii 
Rj. 

The second moment of the density contrast is related to galaxy counts by equation (6). 
The mean square of the galaxy count is given by 

1 N caai 



E Ni J 



(A22) 



N C e\\,i a =l 

where iVceu,t is the number of cells of radius Ri. We assume that the mean galaxy count (Aj) 
is accurately approximated by nvi, where Vi = 4-7ri?j 3 /3, and that the uncertainty of the mean 
number density n is sufficiently small. This assumption corresponds to the case when we know 
a precise selection function from another observation and a sufficiently accurate mean number 
density from that selection function. Therefore, if we estimate mean numbers of galaxies from 
a same data set, the above assumption is inaccurate. By combining equations (6), (22), and 
(A22), the expression of the covariance matrix is obtained. After some calculation, we have 



Ci/, 



i 



N, 



ccll,i 



E to 




d 3 r a d 3 r/3 (N iia 2 Ni/)- 



(Ni) 2 (Nj) 2 



1 

V 1 ^ 



Acell.i 

d 3 r a d 3 r p (NijNj/) - (N 2 ) (n. 



N: 



(A23) 
(A24) 



where V indicates a survey volume and (A r j. , 2 A r . 



j,p J is a product's mean value of the squared 
galaxy counts in two different cells. In this study, we consider that cells are located randomly in 
a spherical survey volume by following Colombi, Bouchet, Schaeffer (1995); Szapudi, Colombi 
(1996). Then, the sums over cells can be replaced by integrations over survey volume, as 
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12a(" ') = Nceiij/V J v d 3 r a (- ■ ■). However, in equation (A23), we can divide \Ni^N^p 2 j into 
two cases depending on a = (3 or not. ^iVj )Q! 4 ^ is a term that comes from a case of a = {3 and 
represents an averaged value of biquadratic weighted counts over all cells. The second term 
comes from a case where (3 holds. An additional factor (1 — 1/N ce \\ t i) in front of this term 
should be present because of the following reason: The number of cases satisfying a ^ /3 is 
•WceiM — ^Vceii,j Therefore, the sum is replaced by the volume integration as Y, a ^p J2p( m ■ •) = 
(^Ncew^ 2 — A^ceii,^ /V 2 J d 3 r a d 3 rp (• • •). (N 2 ^ is a theoretically expected square mean of number 
count related with the variance by equation (A5). 

To calculate each term, we define a similar notation as equation (All): 



7 (P>9) 



Jd 3 r a d 3 r b {W(\r a - 




;R i )} p+1 {W(\r b -rp 




Jd 3 r a {W(\r a -r a 




f d 3 r b {W(\r b - 


r /3 





(A25) 



where £ ab = £(|r a — r b \) is the two-point correlation function and r a and rp are centers of 
two cells. Equation (All) considers correlations in a same cell. On the other hand, equation 
(A25) considers correlations in two different cells. We also use similar notations for the three- 
and four-point correlation functions. In a case considering correlations in a same cell, we use 
equation (A17) and 

rj%> 9 ' r > s) = J dV dV 6 dV c dV d 7 7a6crf {W(|r a |;^)r +1 {W(|r 6 |; J R)} 9+1 



x{^(|r c | ;j R)r +1 {W/(|r d |;i?)} 



is+l 



d 3 r a {W(\r a \;R)} p+1 J d 3 r b {W(\r b \;R)r +1 



x 



d 3 r c {W(\r c \;R)} r+1 / d 3 r d {W(\r d \;R)} 



is+l 



(A26) 



^>aa.p 



where we abbreviate the four-point correlation function r](r a ,r b ,r c ,r d ) as r] abcd . To consider 
correlations in two different cells whose centers are located at r a , rp, we use 

J ^ a ^V c Ca fec {^(|r a -r a |;i?^r +1 {H / (|n-r Q |; J R,)} ( ' +1 

x{W(\r c -rp\;R j )Y +1 
x J d 3 r a {W(\r a -r a \;R l )Y +1 J d 3 r b {W(\r b -r a \;R ] )r +1 

'd^lWQr.-rp^Ri)}^ 



(A27) 



S a/3 /3 



d 6 r a d 3 rJ 6 r d C ac 4W(\r a -r a \;R i )} p+1 {W(\r c -rp\;R j )} 
xiWdr.-r^R,)}^ 1 

J d 3 r a {W(\r a -r a \;R i )Y+ 1 J d 3 r c {W(\r c - rp\; R,)} 
< Jd^aiWQrd-rp^R^ 



i r+l 



(A28) 
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xiWdrc-rpbRjfi^iWdra-rpbRj)} 



IS+l 



J d?r a {W{\r a -r a \-R l )Y +1 J d?r b {W{\r b -r a \ ] R ] )Y +l 



x I d^clWHr.-r^Ri)}^ 1 I d'rdiWQrd-rpl-Rj)} 



[8+1 



(A29) 



where we abbreviate the three-point correlation function C(r a ,rh,r c ) as ( a b c . 

The first term in equation (A23) is calculated in the same manner as in Appendix 1. 
Then, we obtain 

i\ 



Ni 



(T, n aW(\r a -r a \-R l )) 

VaGct / 



(N)W 3 + 4(N) 2 W 2 ( 1 + ^ ) + 3(A^) 2 (W) ( 1 + £ 



=(i.i) 



+6(^)W(i+4°; 0) +2er ; +c) ? J 



f (l,0) >(1,0,0) 



(A30) 



where J2 a ea represents a summation over every infinitesimal volumes 5v a in a cell a. 
From equation (A5), the last term in both equations (A23), (A24) is given by 



(0,0) 



WW W + W(w 1 +GT + W'W^ 1 



=(0,0) 



(0,0) 7(0,0) 7(0,0)7(0,0) 



(A31) 



A derivation of the second term in equation (A23) is complicated because of overlappings 
of cells. First of all, the integration is divided into two ranges of r depending on whether cells 
are overlapping or not, i.e. /(• ■ -)d 3 r a d 3 rp = J raP < Ri+R] {- ■ -)d 3 r a d 3 rp + f rag>Ri+R .(- ■ -)d 3 r a d 3 rp, 



where r, 



a/3 



Wa ~ ■ On the one hand, for cell pairs that are not overlapped to each other, 



we calculate (N ij0l N^p J in the same way as equation (A30). Then, we obtain 
--^(T, n aW(\r a -r a \;R l ) ] j ^£n c W(\r c - r p \;Rj) j ^ 



WW) (WY + (N i )(N j ) 2 (l + ^) + (Ni} 2 (Nj) (l +e£ 



=(0,0) 



(0,0,1) 

oto.fi 
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(0,0) , (0,0,0) >(0,0,0) 



+ 2^ ( °' 0) V + r7 (0 ' '°' 0) 



(A32) 



On the other hand, for cell pairs that are overlapped to each other, we calculate by 
following Colombi, Bouchet, Schaeffer (1995); Szapudi, Colombi (1996). A configuration of 
cells are shown in figure 9. We assume that the galaxy distribution within overlapped cells is 
locally Poisson process. We divide each cell into infinitesimal volumes as before and consider 
two cases depending on whether each infinitesimal volume 5v a is included in a overlapped region 
or not. Then, the summations in equation (A32) is divided into 



E = E + E + 2 E . 

a,b£a a,b& a,beTL o6l,6eI 

E = E + E + 2 E • 

c,de/3 e,den c,dem cen.dem 
We introduce other notations to show the results 



(A33) 
(A34) 



d 3 r a {W{\r a \;R i )} p {W{\r a -r a p\;R j )} l > 



(A35) 
(A36) 
(A37) 



V^ q) = J ][ d 3 r a {W(\r a \;R l )r{W(\r a -r aP \;R ] )y, 

V^' q) = I d 3 r a {W(\r a \;R i )Y{W(\r a -r aP \;R j )Y, 

where each subscript I, n, and m represents an integration over each region I, II, and IH, 
respectively, and r a/ g = r Q — rp. After some calculations, we obtain 

(N i , a 2 N J /) c 



overlap 



77(2,0)77(0,2) , 77(2,0)77(0,2) , 77(2,0)77(0,2) 

2 



2F (2,1) F (0,D 



2vf ,0 Vi 1,2) + yr uy v r i l 
v 



K2,0)t7(0,2) 



+n 3 



2 n 



(1,1) v 



77(2,1)77(0,1) 

zv n v m 



2F (1,0) F (1,2) 



■77(1,0)^77(0,2) +T 7(2,0) ^(0,1)^ 



1 



n 



F (l,0)^ 77(0,2) ^77(2,0) 



V 



m 



r 



(1,0) V 77(0,2) 



111 



F (2,0) 

17 n 



I" 



(0,1) V 
m 



77(2,0)77(0,1)77(0,1) 
AV 1 



77(1,0)77(1,0)77(0,2) 



,(1,0)77(1,1)77(0,1) 



f(2,0) 



f(0,1)\ 



(1,1)77(1,0)77(0,1) 



F (l,0)\ 77(0,2) 



I ,,77(2,0)77(0,1)77(0,1) 

+ 4F (i,o) F (i,D F (o,o) 



77(1,1)77(1,0)77(0,1) 77(1,0)77(1,0)77(0,2) 

4l/ ii v m + 1 ' n 



f (1,0)t7(0,2)V 



K1,0)t7(0,1)\ 



f (1,0)t7(0,1)V 



F (i,o)^ 2 77(0,1)77(0,1) 



HI 



+2F (1,0) F (1,0) / F (0,l)\ 2 +4F (1,0) F (1,0) F J),1 )FS ,1) + (yimym\ 
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2 tv^Yv^V^ + 2Fj 1 '° ) Fi 1 '° ) (V^ : 



(A38) 



It is straightforward to evaluate V terms that are integrated over region II in the above 
equation. For example, by using the Fourier transformation, we have 

= I ^W^\kR 4 )W^\kR j )^^ 1 (A39) 

in an overlapped region, where 

W^(kR) = J d 3 r {W{\r\-R)} p e" ik - r . (A40) 

On the other hand, for V terms that are integrated over non-overlapping region, it is not a 
good way to evaluate equation (A35) and (A37) directly, because it seems we need numerical 
calculations in this expression. However, we can analytically subtract a contribution of the 
overlapping region from an integration over a whole cell. First, note that an integration over 
a non-overlapped region has nothing to do with a shape of an overlapping cell. Therefore, the 
contribution from the overlapped region can be calculated by replacing the overlapping cell by 
the top-hat smoothing function having same center and radius as the overlapping cell. Then, 
we obtain 

Vj' q) = Jd 3 r a {W(\r a \;R t )y- J ^^(H;^)}^^. - \r a - r af3 \) 

= vW^-J ^^ (p) (^){^^h(^)}^^, (A41) 
Vi' q) = jdhaiW^r^R^Y- j >r {W(|r a |;i^)pe(i2i-|r -r^|) 

= v J W^-J ^ { Vt uMkR,)}w^HkR 3 f^^, (A42) 

where Vi = 4irRf/3, Vj = 4iiR 3 /3, and Uth{x) = 3(sinx — xcosa;)/x 3 is the Fourier transformation 
of the top-hat function. 

In the case of the m-weight Epanechnikov kernel, for example, we can use equation (12) 
for p = 1 or q = 1. For p = 2 or q = 2, a square of the m-weight Epanechnikov kernel is related 
to the 2m-weight Epanechnikov kernel by a following relation: 

{W , (H;fl)}2 4{M} 2 _^_^ (H;fl) . (A4 3) 
Therefore, we can use 

By using equations (A23), (A24), (A30) - (A32), (A38) - (A42) and by approximating 
1 3> £ 3> C ~ £ 2 3> rj ~ £ 3 , we finally obtain 
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cell, 



W 3 W 2 IW) W 



(Ntf (Ntf (N t y (N t ) 



where 



+ 7^1 



V 2 \ A^ell.i 7 Jr al3 <2Ri 



1 'i 



(Ni) 4 ^ /overlap 



V 2 \ N ce a }i J Jr a p>2R l 



d 3 r a d 3 r/s 



( W ) 7(1.1) ^ 7(1,0) , ,^0,0) 



d 3 r a d 3 rp ■ 



overlap 



d 3 r a d 3 rp 



V 2 Jr a p>Ri+R. 



(w 



< 



(1,1) 



( N i)( N j) p vm 



W 7(1,0) , W 7(0,1) 
> a/3 



(A45) 



+4e 



(0,0) 

a/3 



(A46) 



d 3 r a d 3 r 



V 2 Jr a0 <Ri+Rj 



(N^iNj) 2 \ J '" /overlap 



n 3 Vi 2 Vj 2 



R ^ 3 f d " k m 2 \kR i )W( 2 \kR j )U T n(k(R i + R j )) 



L 



(27T) 
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f d 3 k 2 ~. D ,~, sink 2 r f d 3 k 3 r ~ D \}ixm D N sin ^ 

x J j2ny w{k2Rl)w{k2R 3 ] H^~ I JZkY {^uih^jWihR,)^— 

(A47) 

Appendix 3. Convenient Forms of Equations (All) and (A25) for Numerical 
Calculation 

To calculate equations (All) and (A25) numerically, it is convenient to go forward more 
analytically. By using the inverse Fourier transformation 

{W(\r\;R)} n = J -^W^(kR)e ih - r , (A48) 

equation (All) becomes 

M f d*k wMjkR) W^(kR) 

where v = AttR 3 /3. Similarly, equation (A25) becomes 
{p , r) _ f d*k WMjkRjWWjkRj) 



cZa = / — -J- JJ P(k)e- lk - (r «- r e> . (A50) 

^ a/3 J (2tt) 3 Vi WP vjWi K J K J 

Then, the integration over non-overlapped region appeared in equation (A46) becomes 



/2 / d*r a d*r p -& 

V Jr a a>Ri+Ri 



f d*k W^'jkRj W^'jkRj ~ f ih+Jk YfT (UT) , p U 

(A51) 



(2tt) 3 Vi WP vjWi 
because we assume that a survey volume has a spherical shape. 
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Fig. 1. The second moment with a top-hat smoothing function and its errors expected from Fisher matrix 
calculation. Solid line: a common second moment of both samples in the fiducial model; dashed line: 
expected la error for the MG; dot-dashed line: expected la error for the LRG. 
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number of data points number of data points 

Fig. 2. Panel (a): dependences of the matter density parameter element of the Fisher information matrix 
on the number of data points and on the various weights of the m-weight Epanechnikov kernel for both 
the MG sample and LRG sample. Panel (b): dependences of the baryon fraction parameter. The upper 
eight lines are for the LRG sample and the lower eight lines are for the MG sample. For each sample, 
solid line: m = 0; upper dashed line: m = 1; upper dot-dashed line: upper m — 2; dotted line: m = 3; 
dot- dot- dot- dashed line: m = 10; lower dashed line: m = 20; lower dotted line: m = 40; lower dot-dashed 
line: Gaussian window function. 
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Fig. 3. The matter density element of the Fisher information matrix for a non-integer weight kernel. 
Panel (a): the MG sample; Panel (b): the LRG sample. 
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Fig. 4. Dependences of the Fisher matrix element Fa m n m on weight of the kernel. Panel (a): the max- 
imum likelihood point is assumed to be the fiducial model; panel (b): the maximum likelihood point is 
assumed to be (n m ,f B ,tt u ,h,n s ,(T 8 2 ) = (0.3, 0.,0., 0.7, 1.0, 1.0). 
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Fig. 5. The minimum variance bounds of the parameters expected from the LRG sample. We use the 
Epanechnikov kernel, i.e. m = 1. A number of data points is six hundred. Solid lines: 1-er confidence level 
regions when we carry out the two-parameter estimation, showing how two parameters are degenerate 
each other; dashed lines: l-a confidence level regions when we fit all six parameters at a time. 
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Fig. 6. The minimum variance bounds of the parameters expected from the MG sample. We use the 
Epanechnikov kernel, i.e. m = 1. A number of data points is four hundred. Lines have same meanings as 
figure 5. 
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Fig. 7. Confidence contours marginalized over four parameters, i.e. the matter density parameter, the 
baryon fraction, the neutrino density parameter and eg 2 - Solid lines: 1-a confidence ellipsoids for the 
LRG sample; dashed lines: 1-a confidence ellipsoids for the MG sample. 
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8. Schematic picture of dividing cells into a series of infinitesimal volumes. 
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Fig. 9. Schematic picture of two overlapping cells whose centers are r a for a cell whose radius is Ri and 
rp for a cell whose radius is Rj. r a p = \r a — rp\ represent a separation between two centers. Region I 
contains cell a except overlapping region and region II is the overlapping region and region III contains 
cell P excepting overlapping region. 



33 



